Siamese Network for RGB-D Salient Object Detection and Beyond
نویسندگان
چکیده
Existing RGB-D salient object detection (SOD) models usually treat RGB and depth as independent information design separate networks for feature extraction from each. Such schemes can easily be constrained by a limited amount of training data or over-reliance on an elaborately designed process. Inspired the observation that modalities actually present certain commonality in distinguishing objects, novel joint learning densely cooperative fusion (JL-DCF) architecture is to learn both inputs through shared network backbone, known Siamese architecture. In this paper, we propose two effective components: (JL), (DCF). The JL module provides robust saliency exploiting cross-modal via network, while DCF introduced complementary discovery. Comprehensive experiments using 5 popular metrics show framework yields detector with good generalization. As result, JL-DCF significantly advances SOTAs average ~2.0% (F-measure) across 7 challenging datasets. addition, readily applicable other related multi-modal tasks, including RGB-T SOD video SOD, achieving comparable better performance.
منابع مشابه
Local Background Enclosure for RGB-D Salient Object Detection - Supplementary Results
The purpose of this supplementary material is to examine in detail the contributions of our proposed Local Background Enclosure (LBE) feature. A comparison of LBE with the contrast based depth features used in state-of-the-art salient object detection systems is presented. The LBE feature is compared with the raw depth features ACSD [1], DC [3] and a signed version of DC denoted SDC on the RGBD...
متن کاملRGB-D Salient Object Detection Based on Discriminative Cross-modal Transfer Learning
In this work, we propose to utilize Convolutional Neural Networks (CNNs) to boost the performance of depth-induced salient object detection by capturing the high-level representative features for depth modality. We formulate the depth-induced saliency detection as a CNN-based cross-modal transfer problem to bridge the gap between the " data-hungry " nature of CNNs and the unavailability of suff...
متن کاملLearning Graph Matching for Object Detection from Rgb-d Images
We propose an optimization method for estimating parameters in graph-theoretical formulations of the matching problem for object detection. Unlike several methods which optimize parameters for graph matching in a way to promote correct correspondences and to restrict wrong ones, our approach aims at improving performance in the more general task of object detection. In our formulation, similari...
متن کاملFrustum PointNets for 3D Object Detection from RGB-D Data
While object recognition on 2D images is getting more and more mature, 3D understanding is eagerly in demand yet largely underexplored. In this paper, we study the 3D object detection problem from RGB-D data captured by depth sensors in both indoor and outdoor environments. Different from previous deep learning methods that work on 2D RGB-D images or 3D voxels, which often obscure natural 3D pa...
متن کاملSemantic Parsing for Priming Object Detection in RGB-D Scenes
The advancements in robot autonomy and capabilities for carrying out more complex tasks in unstructured indoors environments can be greatly enhanced by endowing existing environment models with semantic information. In this paper we describe an approach for semantic parsing of indoors environments into semantic categories of Ground, Structure, Furniture and Props. Instead of striving to categor...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Pattern Analysis and Machine Intelligence
سال: 2021
ISSN: ['1939-3539', '2160-9292', '0162-8828']
DOI: https://doi.org/10.1109/tpami.2021.3073689